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Abstract. Visibly pushdown transducers form a subclass of pushdown transduc- 
ers that (strictly) extends finite state transducers with a stack. Like visibly push- 
down automata, the input symbols determine the stack operations. In this paper, 
we prove that functionality is decidable in PSpace for visibly pushdown trans- 
ducers. The proof is done via a pumping argument: if a word with two outputs 
has a sufficiently large nesting depth, there exists a nested word with two outputs 
whose nesting depth is strictly smaller. The proof uses technics of word com- 
binatorics. As a consequence of decidability of functionality, we also show that 
equivalence of functional visibly pushdown transducers is ExpTlME-C. 

1 Introduction 

In fT], it has been shown that visibly pushdown languages (VPL) form a robust subclass 
of context-free languages. This class strictly extends the class of regular languages and 
still enjoys strong properties: closure under all Boolean operators and decidability of 
emptiness, universality, inclusion and equivalence. On the contrary, context-free lan- 
guages are not closed under complement nor under intersection, moreover universality, 
inclusion and equivalence are all undecidable. 

Visibly pushdown automata (VPA), that characterize VPL, are obtained as a restric- 
tion of pushdown automata. In these automata the input symbol determines the stack 
operation. The input alphabet is partitioned into call, return and internal symbols: if a 
call is read, the automaton must push a symbol on the stack; if it reads a return, it must 
pop a symbol; and while reading an internal symbol, it can not touch, not even read, 
the stack. Visibly pushdown transducers have been introduced in [ 11 1. They form a sub- 
class of pushdown transducers, and are obtained by adding output to VPA: each time 
the VPA reads an input symbol it also outputs a letter They allow for e-transitions that 
can produce outputs. In this paper, we consider visibly pushdown transducers where 
this operation is not allowed. Moreover, each transition can output not only a single 
letter but a word, and no visibly restriction is imposed on this output word. Therefore 
in the sequel we call the transducers of 1 1 1 1 e-VPTs, and VPTs will denote the visibly 
pushdown transducers considered here. 

Consider the VPT T of Figure[T] Call (resp. return) symbols are denoted by c (resp. 
r). The domain of T is Dom{T) = {ci(c2)"c3r3(r2)"ri | n G N}. For each word 
of Dom{T), there are two accepting runs, corresponding respectively to the upper and 
lower part of T. For instance, when reading ci, it pushes 71 and produces either d 



(upper part) or dfc (lower part). By following the upper part (resp. lower part), it pro- 
duces words of the form dfcab{cabcab)"gh (resp. dfc{abc)'^ab{cab)"gh). Therefore T 
is functional. 




In this paper, we prove that the problem of determining if a VPT transduction is 
functional is decidable. In particular, our algorithm is in PS PACE. Deciding functional- 
ity is one of the main problem in transduction theory as it makes deciding equivalence 
of functional transducers possible. Both problems are undecidable for pushdown trans- 
ductions. Our proof relies on a pumping argument; if a word is long enough and has two 
outputs, we show that there is a strictly shorter word with two outputs. We use technics 
of word combinatorics and in particular, a strong result proved in [SJ. As a consequence, 
we show that the equivalence problem for VPTs is ExpTlME-C. 

Related Work e-VPTs have been introduced in fTTl. In contrast to VPTs, they allow 
for e-transitions that produce outputs, so that an arbitrary number of symbols can be 
inserted. Moreover, each transition of a VPT can output a word while each transition of 
an e-VPT can output a single letter only. The VPTs we consider here are strictly less 
expressive than e-VPTs, but functionality and equivalence of functional transducers are 
decidable, which is not the case for e-VPTs. 

The functionality problem for finite state transducers has been extensively studied. 
The first proof of decidability was given by Schiitzenberger in IJ2J . and later in 13). 
As the proof we give here, the proof of Schiitzenberger relies on a pumping lemma 
for functionality. The first PTiME upper bound has been proved in [T|, and an efficient 
procedure has been given in f2\ . 

Deciding equivalence of deterministic (and therefore functional) VPTs is in PTiME 
ifTSl . However, functional VPTs are strictly more expressive than deterministic VPTs. 
In particular, non-determinism is often needed to model functional transformations 
whose current production depends on some input which may be arbitrary far away from 
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the current input. For instance, the transformation that swaps the first and the last input 
symbols is functional but non-determinism is needed to guess the last input. 

Ordered trees over an arbitrary finite alphabet S can be naturally represented by 
well nested words over the structured alphabet E x {c} U S x {r}. As VPTs can 
express transductions from well words to well nested words, they are therefore well- 
suited to model tree tranformations. We distinguish ranked trees from unranked trees, 
whose nodes may have an arbitrary number of ordered children. Ranked tree trans- 
ducers have received a lot of attention. Most notably, tree transducers [4] and macro 
tree transducers |6| have been proposed and studied. They are incomparable to VPTs 
however, as they allow for copy, which is not the case of VPTs, but cannot define any 
context-free language as codomain, what VPTs can do. Functionality is known to be 
decidable in PTiME for tree transducers [T3l. More generally, finite-valuedness (and 
equivalence) of tree transducers is decidable [14J. There have been several attempts to 
generalize ranked tree transducers to unranked tree transducers 1191 101 . As mentioned in 
|[5l, it is an important problem to decide equivalence for unranked tree transformation 
formalisms. However, there is no obvious generalization of known results for ranked 
trees to unranked trees, as unranked tree transformations have to support concatena- 
tion of tree sequences, making usual binary encodings of unranked trees badly suited. 
Considering classical ranked tree transducers, their ability to copy subtrees is the main 
concern when dealing with functionality. However for VPTs, it is more their ability to 
concatenate sequences of trees which makes this problem difficult, and which in a way 
led us to word combinatorics. To the best of our knowledge, VPTs consist in the first 
(non-deterministic) model of unranked tree transformations for which functionality and 
equivalence of functional transformations is decidable. 

Organization of the paper In Section |2] we define visibly pushdown transducers as a 
extension of visibly pushdown automata. In Section [3] we recall some notion of word 
combinatorics. In Section |4] we give a reduction of functionality to a system of word 
equations. In Section |5] we prove a pumping lemma that preserves non-functionality. 
Finally, we give a PSpace algorithm for functionality is Section|6]and prove the EXP- 
TlME completeness of equivalence. 

2 Visibly Pushdown Transducers 

Let be a finite alphabet partitioned into two disjoint sets Ec and Er denoting respec- 
tively the call and return alphabet^ We denote by S* the set of words over S and by e 
the empty word. The length of a word u is denoted by \u\. The set of well nested words 
Z"*^ is the smallest subset of S* such that e e Z"*^ and for all c e S", all r e Z"", all 
u, w e Z^n' cwr e and uv G The height of a well nested word is inductively 
defined by h{e) = 0, h{cur) = 1 + h{u), and h{uv) — max{h{u),h{v)). 

Visibly Pushdown Languages A visibly pushdown automaton (VPA) |[T] on finite words 
over Z is a tuple A — {Q, I, F, F, 5) where Q is a finite set of states, I Q, respec- 
tively F C Q, the set of initial states, respectively final states, F the (finite) stack 

' In contrast to 1 1 1, we do not consider internal symbols i, as they can be simulated by a (unique) 
call Ci followed by a (unique) return ri 
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alphabet, and S = Sc ^ Sr where 6c Q Q x Sc T x Q the call transitions, 
5r ^ Q X Sr >^ r X Q aie the return transitions. On a call transition (q, a, (?', 7) S 5c, 
7 is pushed onto the stack and the control goes from q to q' . On a return transition 
{q, 7, a, <?') S (5r, 7 is popped from the stack. Stacks are elements of F*, and we de- 
note by _L the empty stack. A run of a VPA A on a word w ~ ai ... a/ is a sequence 
crfc)}o<fc<^ where q^ is the state and G F* is the stack at step k, such that 
(?o € /, CTp = -L, and for each k<l,we have either: (i) {q^, Ofe+i, 7, qk+i) G Sc and 
CTfe+i = <^kl\ (m) (9fc,afe+i,7,9/c+i) e 5r, and cr^ crfc+17. A run is accepting if 
qi G F and cr/ = _L. A word ui is accepted by ^ if there exists an accepting run of A 
over w. Note that it is necessarily well nested. L{A), the language of A, is the set of 
words accepted by A. A language L over Z" is a visibly pushdown language if there is 
a VPA A over T such that L{A) = L. 

In contrast to [1 ] and to ease the notations, we do not allow transitions on the empty 
stack. Therefore the words accepted by a VPA are well-nested (every call symbol has a 
matching return symbol and conversely). 

Visibly Pushdown Transducers As finite-state transducers extend finite-state automata 
with outputs, visibly pushdown transducers extend VPA with outputs. To simplify nota- 
tions, we suppose that the output alphabet is S, but our results still hold for an arbitrary 
output alphabet. 

Definition 1 (Visibly puslidown transducers). A visibly pushdown transduce^(\/PT) 
on finite words over is a tuple F — {Q, I, F, F, 6) where Q is a finite set of states, 
/ C Q is the set of initial states, F C Q the set of final states, F is the stack al- 
phabet, S = 6c^ 6r the transition relation, with 6c ^ Q x Sc x S* x F x Q, 

6r QQ X Sr X S* X F X Q. 

A configuration of a VPT is a pair (g, cr) £ Q x F* . A run of T on a word u = 
ai . . . ai £ S* from a configuration {q, a) to a configuration {q', cr') is a finite sequence 
p = {{qi,cri)}o<k<i such that qo ^ q, a ^ ctq, q' = g„, cr' = cr„ and for all i G 
{1, ...,/}, there exist Vi G S* and ji £ F such that {qi — 1, a.i, Vi, ji, qi) € 6c and 
either G Sc and ai = cri-i^fi, or G Sr and cr^^i — ai^i. The word v ~ vi . . .vi 

is called an output of p. We write {q, a) {q' , cr') when there exists a run on u from 
(9, cr) to ((7', cr') producing v as output. The transducer T defines a word binary relation 

[T] = {{u,v)\3q£l,p£ F, {q, ±) ^ {p, ±)}. 

The domain of T, resp. the codomain of T, denoted resp. by Dom{F) and CoDom{F), 
is the domain of |T]], resp. the codomain of |T|. Note that the domain of T contains 
only well nested words, which is not the case of the codomain in general. 

In this paper, we prove the following theorem: 

Tlieorem 1. Functionality of MPT s is decidable in PSpace. 
The rest of the paper is devoted to the proof of this theorem. 

^ In contrast to 111], there is no producing e- transitions (inserting transitions) but a transition 
may produce a word and not a single symbol 
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3 Preliminaries on Word Combinatorics 

The size of a word x is denoted by \x\. Given two words x,y € 17*, we write x ^ y 
if X is a prefix of y. If we have x ^ y, then we note x^^y the unique word z such that 
y — xz. A word a; e Z"* is primitive if there is no word y such that \y\ < \x\ and 
X ^ y*. The primitive root of a word x ^ U* is the (unique) primitive word y such that 
a: G y*. In particular, if x is primitive, then its primitive root is x. Two words x and y 
are conjugate if there exists z G Z"* such that xz = zy. It is well-known that two words 
are conjugate iff there exist ii, ^2 G ^* such that a; = tit2 and ?/ = t2ii. Two words 
x,y £ S* commute iff a;?/ = yx. 

Lemma 1 (folklore). Let x,y £ S* and n, to G N. 

1. if X and y commute, then x,y £ z* for some z £ E*. Moreover, if xy is primitive, 
then X — e or y — e; 

2. if x" and have a common subword of length at least \x\ + \y \ — d (d being the 
greatest common divisor of\x\ and \y\), then their primitive roots are conjugate. 

Proof. The first assertion is folklore. For the second, there exists z £ S* and a, [3 > 
such that X = and y = z^. If x and y are non-empty, then a, /3 > and z ^ e. Thus 
xy = z"+'^, which contradicts the primitivity of xy. 

Lemma 2 (Hakala,Kortelainen, Theorem7of [8|).Lef wq, wi, Wmii'r, Vq, wo,wi,Wm, yjj, £ 
E* andi £ N. IfvQ{viyvm(vj)'''VQ = woiwiYwrniw-YwQholds for alii £ {0, 1, 2, 3}, 
then it holds for all i G N. 

Let a; G Z*, we denote by G Z" the infinite (countable) concatenation of x. 
Lemma 3. Let x, xi,X2, y, z,ti,t2,p, q £ E* with tit2,p, q primitive, then: 

7. ifti -< p and xpti — ypp then xp'^ — yp'^ 

2. ifxp^ = yp^ then 3a, /3 > : xp°' = yp^ 

3. ifx{tit2Y = y{t2tiY and ti ^ e, then 3a, /3 > : x{tit2)°' = 2/(^2*1)^*2 

4. ifxlht-iY = {t2ti)'^ andti ^ e, then 3a > : x = (*2ii)"<2- 

5. ifVi £ {1,2}, x^y{tit2)" = y{tit2)'^ then 3ai,a2 > 0,3^3,^4 G E* : Uti = 
tit2,Xi = {tih)°'^ 

6. ifxp'^ = p^ then 3a > : a: = 

7. //'3a > such that p"xp'^ = xp'^, then x £ p*. 

8. //'3a > 0, q'^yp^ — yp^ then qy ~ yp 

9. if 3a, /3, 7 > 1 such that x{tit2Yy{tit2Y z = (^2*1)'', then y £ {tit2)*. 

Proof. 1. Let t2 such thatp = tit2, then xtit2ti ~ ytit2tit2, by Lemma[T]<i = e or 
t2 — e i.e. either ti — e 01 ti — p. 

2. Direct consequence of the previous property since we have xp^ti — yp^ for some 
a, /3 > 1 and ti -< p. 

3. By applying the previous property to a;(tit2)" = J/*2(ii*2)'^- 

4. The second assertion is a direct consequence of the first when taking y — e. 
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5. It is clear if xi = X2 ^ e. Suppose that xi ^ e. Since Xiy{tit2)'^ — 2/(^1^2)", we 
also have xia:iy(iii2)" = 2/(^1^2)'^, and more generally, forall /3 > 1, {xi)'^y{tit2)'^ = 
^(^1^2)"- By taking f3 large enough, there exists 7 > such that (xi)^ and (iii2)^ 
have a common factor of length at most | + 1^1^21 ^gcd(|xi |, |tii2|)- By the fun- 
damental lemma, there exists ^3, G S* such that t^t^ is primitive, xi £ (^4^3)* 
and tit2 G (^3^4)*. Since tit2 is primitive, we have ^1^2 = ^3^4- Suppose that 

X2 7^ £• Similarly, we can prove that X2 = (^4^3)'' for some 7 > and t'^, t'^ such 
that tit2 = t3t4. We have Xiy{tit2Y = X2y{tit2Y , therefore = t'^t'^, and 
X2 e (^4*3)*- 

6. We have a;p" = so we also have pxp'^ — p", therefore xp^ = pxp'^ i.e. 
.xp = px, and by Lemma[T] a; G p*. 

7. We clearly have xp" — p"x therefore, by Lemma[Tl x G p*. 

8. We have q'^yp^ — yp^ , this implies that for any x > q^'^yp^ — yp'^. Therefore, 
there exist (3 > and ti < q with y — q^ti. Let t2 G i7* such that q = tit2, we 
have {tit2)"'^^ti ~ {tit2)^tip" . Therefore because \p\ = \q\ = |tit2| we have 
p = ^2^1- This concludes the proof. 

9. We assume ti,t2 ^ e (otherwise it is obvious). By [T] and |4] we have that x — 
{t2tiYt2. By the same argument we have z = <i(t2ii)'' So we have: t2(tii2)"^''y(iit2)^^''ii = 
lt2tiy'. Therefore y G {tit2)* . 

□ 

4 From Functionality to Word Equations 

Given some words uq, . . . w„, Um, Un, . . . , Mq G S*, fc G N, and a function vr : {1, . . . , fc} — > 
{1, . . . , n}, we denote by the word wow^(i) . . . UT^(j)UrnUjf(^jj . . . Ujf^i^Ug. We denote 
by idn the identity function on domain {1, . . . , n}. The following lemma states that if 
a word u translated into two words w is high enough, u, v and w can be decomposed 
into sub words that can be removed, repeated, or permutted in parallel in u, v and w, 
while preserving the transduction relation. 

Lemma 4. Let T be a VPT with N states, and n > 1. Let u,v,w G S* such that 
f, w G T(u) (u is thus well nested) and h(u) > nN"^. Then there exist Um, v„i, Wm G 
E* and Ui^Uj, Vi,Vj,Wi,Wj G S* for alii G {0, . . . , n} such that Uid„ — u, Vid„ — v, 
U!id„ = w and for all k ^ N and all tt : {1, . . . , fc} — > {1, . . . , n}: u^, w-^ G T^u^^) 
and Ui, uj ^ efor all i = 1, . . . ,n. 

Proof Let T be a VPT, with set of states Q. Let N ^ \Q\, n > I, and u,v,w G S* 
such that v,w E T{u) and h{u) > nN^. In particular, u is well nested. We denote by £ 
the length of the word u and write u ~ {aj)i<j<e, with aj G S for all j. There exists a 
position 1 < j < ^ in u whose height is equal to h{u). We fix such a position j. Then, 
for any height < fc < h{u), we define two positions, denoted a(fc) and /3(fc). a{k) 
(resp. /3(fc)) is the largest (resp. the smallest) index d, such that d < j (resp. d > j) and 
the height of u in position d is equal to fc. The part of the word concerned by mapping 
a (resp. (3) is represented in blue (resp. in red) on Figure |2] 

As u, w G T{u), there exists two runs Qy, on u in T which produce respectively 
the outputs V and w. We denote by {pi)o<i<£ (resp. {qi)o<i<i) the states we encounter 
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Fig. 2. Form of pumping 



along Qi, (resp. g^). As h(u) > nN'^, there exists two pairs of states {p,p'), {q, q') G 
such that 

{0 < fc < h{u) I pa(k) = P andp^(fc) = p' and q^^k) = q and q^K^k) > n 

We denote by < fci < . . . < fc„+i < the n + 1 different heights associated 
with the pairs {p,p') and {q, q'). For each i — 0, . . . , n — 1, this means that the two 
runs pass simuhaneously in states p and q before a call transition with a height equal to 
ki, and that the height of the stack will never be smaller than ki, until reaching again 
states p and q with a stack of height fci+i. A symmetric property can be stated for states 
p' and q' . As a consequence, we obtain n fragments which behave as synchronized 
"call loops" around p and q with corresponding "return loops" around p' and q' . This 
situation is described on Figure |2] 

Then, we can define the different fragments of u as follows: (see Figure|2]l 

- = ai . . . aa{kx)-i, 

- yi < i < n,Ui = a^iki) ■ ■ ■ aa{ki+i)-i, 

- U„i — aa(k^j^i) ■ ■ ■ ai3(k„^i)-i, 

- \/l <i <n,uj = ap(^ki+i) ■ ■ ■ a^(fc,)_i, 

- Ug = ap(^ki) ■ --ae. 

We immediately obtain u — Uid„ and Ui,u- ^ e for alH = 1, . . . , n. The decompo- 
sitions of V and w are obtained by considering the outputs produced by the correspond- 
ing fragments of u on the two runs and gw 

Finally, the property of commutativity ( v^^, w.^ G T{u-^) for all vr : {1, . . . , fc} — >■ 
{1, . . . , n}) easily follows from the fact that for each i G {1, . . . , n}, the fragments of 
the runs associated with Ui and u- do not depend on the content of the stack as T is a 
visibly pushdown transducer. □ 
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The following lemma states that if a word u with at least two outputs is high enough, 
there is a word u' strictly less higher with at least two outputs. 

Lemma 5. Let T be a VPT with N states and u e Dom{T) such that \T{u)\ > 1 and 
h{u) > 8N^. There exists u' £ Dom{T) such that |r(M')| > 2 and \u'\ < \u\. 

Proof. Let v,w G T{u) such that v ^ w. Thanks toLemmaS] there exist u™, Wm, Wm G 
S*, and for all i e {0, . . . , 8}, there exist u.;, uj, Vi, vj,Wi,wj ^ S* , such that Uidg = u, 
Vidg = V, Widg = w and for all /c e N and all tt : {l,...,fc} — 
VttjWt^ e T{ut^) and Ui^uj ^ e for all i = l,...,n. We prove that there exist 
k G {0, . . . , 7} and tt : {1, . . . , j} — > {1, . . . , 8} such that ^ and \uTr\ < 
\u\. We proceed by contradiction. Suppose that for all /c G {0, . . . , 7} and for all 
TT : {1, . . . , fc} — > {1, . . . , 8} such that \utj\ < \u\ we have Vtt = Wj^. This defines 
a system of equations S — {w^ = | tt : {1, . . . , fc} {1, . . . , 8}, < \u\}. We 
show in the next section that it implies v — w (Theorem|2]l. 

5 Word Equations 

In this section, we fix some n > 8, some words Um,Vm,Wm G and for all i G 
{0, . . . , n}, we fix Ui,Vi,Wi, uj, vj, wj £ U* such that Ui,uj ^ e. We consider the 
system S — {u^ — Wti \n : {1, . . . , fc} — > {1, . . . , n}, \ut^\ < \uid^\}- The main result 
we prove is the following: 

Theorem 2. IfS holds, then Vid„ — u)id„- 

We let £ G {1, . . . , n} such that \uiUj\ < \uiUj\ for all i G {1, . . . , n}. We consider 
several cases to prove Theorem|2l 

(1) \vi\ = \wi\ (2) \vi\ > \wi\ (3) \wi\ > \vi\ 

Cases 2 and 3 being symmetric, we consider cases 1 and 2 only in the two following 
subsections. 

5.1 Proof of Theorem 111 case l^^l > \wi\ 

We denote by iS[|ti£| > |] the system S with the assumption \ve\ > \wi\ and from 
now one we assume that this system holds. We consider the following set of equations, 
defined for all a,b > and alH G {1, . . . , n}: 

VoVmVQ = WoWmWQ (1) 
VQivi)''Vm{vj)"-VQ = WoiwtYWyniwjTw^ (2) 
VliVi{viYVm{vjYviV^ = WoWi{wiYWyn{wjYwjW^ (3) 
Vo{viYVi{vtYv,n{vjf'vj{vjYv^ = Wq (Wf)" (wj)''i(>j(wj)''iOq (4) 

For fc G {1, 2, 3, 4}, we denote by Sk the subsystem that of equations of type fc. For 
instance, ^2 is the system of equations {vo{vi)°-Vm{'Vj)°"VQ = WQ{we)"'Wm{wj)°"WQ \ a G 
N}. 
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Lemma 6. For aZZ fc e {1, . . . , 4}, 5^ holds. 

Proof. First, jwoUmUol ^ I UQUmU-^ ~ u^, where it is the function with empty 
domain. Since S holds by hypothesis, this equation holds. 

We prove that 1S4 holds, as ^3 is a particular case of ^4 and S2 is a similar but easier 
case. First, ^4 holds for all a, & G {0, 1, 2, 3}. Indeed, since n > 8, there are six pairwise 
different integers zi, . . . , ig € {1, . . . , n} such that ik ^ i for all fc G {1, . . . , 6} and 
6|u^U;j| + \uiUj\ < \uiU-\ + X]/c=i l^ifc^i^l < l"jd,J- Sccond, by Lemma|2] ^4 holds 
for all a G N and 6 = 0, 1, 2, 3. If we fix ao G N, it holds for a aq and 6 = 0, 1, 2, 3. 
Thus by Lemma|2]it holds for a — ao and all & G N. □ 

Proposition 1. For all i E {1, . . . , n}, \viVj\ — \wiWj\. 

Proof. This is implied by Si and 1S4 (with a — b = 0). □ 

Thanks to iSi , . . . , 1S4 we can characterize the form of Vi,Wi, wj for all i and prove 
a property on Wm- This characterization is then used to prove Vid„ = Wid^- Wlog 
we assume that = e or wq = e, and Uq = e or ufg = e. Otherwise we can remove 
their common prefixes in 5i , . . . , ^4. 

Lemma 7. If there exist k G {1, . . . , n} such thatWk 7^ £• TTzen there exist ti,t2,t3, G 
17*, ao, /3o 5^ 0, a^, jSj > Ofor all i G {1, . . . ,n} such that tit2 is primitive and for 
alii G {!,..., n\: 

and if Wo — e, f/ien fo = (^4^3)"°i4, andifvo = e, then wo — {t3t4)^°t3. 

Proof. First we infer the form of and wj. Since I W£ I > jw^l, by ^2, there is a > such 
that (ve)"" and (u^)" have a common factor of length at least \vi\ + \wj\ — gcd(|uf |, \wj\) 
(see Fig.|3]l. Therefore by Lemma[T]2, there exist ti,t2 G S* such that tit2 is primitive, 

ve = {tit2)°'^ Sindwj = (^2^1)'^" for some a^, > 0. 



Second we derive the form of Vi and for all i G {1, . . . , n}. As U£ / e there 
is 60 > 1 such that \{ve)^"~^Vra{vj)^°v-iVQ\ > \wq\. We consider ^4 with b = Bq. 
The size of the suffix V£Vi{vi)''°Vm{Vi)''°Vi{vj)"'Vo is of the form li{a) = fci + a\tij\ 
and the size of the suffix {wj)°'TjifQ is of the form ^2(0) — k2 + a\wj\. As > \vj\ 




first word 




second word 



Fig. 3. System ^2 for large values of a, case \vi\ > \we\. 
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(by Proposition [T] and \vt\ > \wi\), there exists oq > 1 such that ^2(00) > ^i(ao)- 
Therefore (see Fig. Hjl veViVe is a factor of (wj)"-". Thus there is X,Z G S* such that 
X{tit2)°"v^{tit2)°'<'Z = {t2tiY'>^. Since a^,^y > 0, we can apply Lemma [3]9] and 
we get Vi G (iit2)*- Since \'Wj\ > \vj\ and wj = {t2ti)^, by symmetry, we also get 
e it2ti)*. 



\/////////////A 



\ y'////////////A 



end of the first word 



end of the second word 



Fig. 4. System 55 for value fog ™d large values of a, case \vi\ > \wi\. 



Third we determine the form of the words Wi and prove the property on Wm- Since 

= {tit2)°'', wj = {t2ti)^^ and Vi = (tit2)"' for some > 0, ^2 and ^3 can be 
rewritten as follows: 

«0(ili2)"-"'Wm(wi)X = Wo{Wi)''Wrnit2hr^'W^ 

Since > there exist a, f3, 7, 7' > 2 and t', t" -< tit2 such that 

VoihhTt' = Wo{weYwrnt2{tlt2V Vo{tlt2Tt" = (w^ Wmt2 1*2 

By Lemma ism we get wo(*ii2)" = woi'^i(u'£)'^'"^m(*2*i)'^ and 

W0(ilt2)" = Wo(u;^)'^W™(t2tl)" (1) 

Therefore 

iwefwmit2tl)'^ =W^{wifwm{t2tir (2) 

Eq.|2]is equivalent to (wf )^u;,„t2(ii*2)" = )'^Wmi2(*i*2)'^. thus by Lemma 

I3I5I there exist ^3,^4 G 17* such that <i<2 = ^3*4 and for alH G {1, . . . , n}, = 
(^4*3)'^' for some (3i > 0. By hypothesis, there is fc G {1, . . . , n} such that Wk ^ e, 
and therefore /3fe > 0. Eq.|2]gives (i4t3)'^' Wm(i2ii)'^ = (i4i3)'''+^'"Wm(t2tir' i-e- 
Wm(i2ii)" = {t4:UY'"Wm{t2tiY . By lemma|3l8]we get Wmt2ti = tihwm- 

Finally, we determine the form of Wo and Wo -If Wo = e, thenEq.[T|gives wo(tit2)" = 
(*4i3)''-^'=w'm(*2ii)"- Since tit2 = hU and ^4*3^™ = Wmt2ti, VQ{t:iUY = \uhY . 
Wlog we can assume that ^3 ^ e. Indeed, G (^1*2)* is non-empty and tit2 = t^ti, 
so that ^3*4 7^ e. By Lemma [3l4l uq G (i4i3)*t4. Alike, if vq = e, then wlog we can 
suppose that t4 7^ e, and conclude similarly that wq G {tztiY't^,. □ 
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The mirror of a word t ^ S* is denoted by t and is inductively defined hye — e, 
ct — tc for all c <E S. The mirror of an equation t = t'ist = t' . By taking the mirror of 
the equations Si, ... , S^, we obtain a system of equations which has the same form as 
5i, . . . , 54. Since \vi\ > jw^ |, by Prop.[Tl \'Wj\ > \vj\. Therefore we can apply Lemma 
|2]on the mirrors of 5i , . . . , ^4 and obtain the following corollary: 

Corollary 1. If there exist k £ { 1 , . . . , n} such that vj: =/= e. Then there exist ti,t2,t^,tQ e 
E*, ao, /3o ^ 0, ai, l3i, j3j > Ofor all i e {1, . . . , n} such that ^2^1 is primitive and for 
all i e {1, . . . , n}: 

andifw^ ~ e, then — ^5(^6^5)"'^. and if ~ e, then Wq = ^6(^5^5)'^° 
We are now equipped to prove that Vid^ = wid^ : 

Theorem 3. S[\vi\ > | |] =^ Vq. .. VnVmVn . . . Uq = WO ■ ■ ■ WnW^Wn ■■ - Wq 

Proof. We consider several cases: 

1. there exist k,k' G {1, . . . , n} such that Wk' ^ e and t. 

By Lemma|2] there exist ti, t2, ^3, ^4 G S* and ao, /^o, ■ • ■ , /^y > 

such that: 

and if = e, then uq = (i4t3)""t4, and if = then wq = (i3i4)*'t3 

By Corollary [T] and the fact that a word is uniquely decomposed as a power of a 

primitive word, there exist <5 , G E* and . . . , > such that: 

and if Wq = e, then = ^5(^6*5)"'''. and if = e, then iv^ — iel^s^e)''" 
We can also suppose that vq = {t3t4)"° = {tit2)"° and wq = (i3t4)^''t3. Indeed, 
if Wo = e, we simply replaced vq by t^vo and by t^WQ. Similarly, we assume 
that (^6*5)'^" and = hihUT". By PropIT] a, + aj = (5, + /?j for all 

I G {1, . . . , n}. Finally: 



= {tit2T°+^'+-+^-Vm{hte)^-+^+"n5 (since a, + a- = ft + ft and 

tlt2Vm = Vmhte) 

= {ht2f^ + -+f'-V0V,n^{UhY^+-+PT 

= itit2)^'+-+^-woWmmo{t6h)^+-+'^ (by5i) 

= (tlt2)''^ + -+''"(i3i4)'^°t3Zi^™(i6t5)''«(t6i5)^-+-+''- 

= {t3U)'^''+'^' + -+'^"t3Wm{t2tl)^+-+f^+'^ (as ^1^2 = ^3^4 and t2ti = ^6^5) 

= WqWi . . . WnWrnWn • ■ • WjWq □ 
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2. for all k G {1, . . . , n}, Wk ^ vj: ~ e. As in the proof of Lemma |7] we can char- 
acterize the form of Vi and mj for alH £ {1, . . . , n}. In particular, there exists 
<i,<2 G such that tit2 is primitive and Vi = (^1^2)"' for some ai > 0, and 
wj — {t2ti)^ for some /3i > 0. By Proposition [T] = /3i for all i. We let 
Wq — woWm and — v„iVq. The systems Si,S2 can therefore be rewritten as 
follows: 

f VqV^ = W'„W^ (1) 

1 vo(iit2)""'«^ = ^(taii)""^^'!! (2) 

Wlog, we can assume that vq — e or w'q = e. Both cases are symmetric, so that we 
consider only the case vq = e. Wlog we can assume that ti ^ e. By Lemma [3P1 
and 1S2, we get w'q = {tit2)"ti for some a > 0. Therefore: 

VqVi . . . VnVrnVn ■ ■ ■ ViVq 

= (iii2)"^+-+""«^, 

= (tii2)"^+-+""«^(,u^by5i 

= (tit2)"i+-+""+"tiu>o 

= M;oM;„(t2ii)"^+-+""i% 

= WqWi . . . WnWmWn ■ ■ ■ WjWo 

3. for all fc e {1, . . . , n}, vj — e and there exists p € {1, . . . , n} such that Wp ^ e. 
By Lemma|7] there exist ii, i2, ^3, ^4 G ^* and fceioo and ai, Pi, f3j > for all 
i G {1, . . . , n} such that ^1^2 is primitive and for all i e {1, . . . , n}, tit2 = t^t^, 
tihwm = Wmt2ti, Vi = (^1*2)"% = {tih)^^ and = {t2ti)^. Moreover, if 
Wo = then = {t4t3)°'°t4, and if vq — e, then wq = itzt4)^°t-i. By Proposition 
[T] since vj = e for all A; G {1, . . . , n}, we get ak = /3k + (i-j:- As for the case given 
in the paper, we can suppose that tiQ = (^3^4)"" = (^1^2)"" and wq = (i3i4)'^°i3- 
Indeed, if wq = we simply replaced by t^VQ and by t^WQ. Finally: 

VqVi . . . VnVmVn ■ • • % 

= itit2r°+-+"-v^vo 

= itit2)"' + -+""V0V^Vjj 

= {ht2r'+-+"-woWmW^hySi 

= WQ{t4t^Y^ + '"+l^"Wm{t2tl)^^"'^'^V0QS\nCe, UtsWm = Wmt2tl 
= fwoWi . . . WnWrnWj . . . WnW^ 

4. for all fc G {1, . . . , n}, Wk — e and there exists p G {1, . . . ,n} such that Vp e. 
This case is symmetric to case 2. 



5.2 Proof of Theorem 111 case \vi\ — \wi\ 

Remind that we have fixed some n > 8, some words Um, Vm, Wm £ S* and for all 
i G {0, . . . , n}, we have fixed Ui,Vi,Wi,uj, vj,ivj G S* such that Ui,uj ^ e such that 
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the following system holds: S — {v^^ = Wir | tt : {1, . . . , fc} — > {1, . . . , n}, \ut^\ < 
Consider the following equations, defined for all a G N, for alH, fc G {I, . . . ,n}: 

VQVmVQ = WQWjnVUg (1) 

Voiv()''Vmivj)''VQ = Wii{wiYWm{wjYv}Q (2) 

VoVi{vtYVra{vjYvjVQ = WoWi{wtYWrn{wjYwjWQ (3) 

VoVrVkiviYVmivjyih^VjV^ = WoWiWk{wiYWrn{wjY'>^'^'U^ (4) 

. . .Vi^lVt+l . ■ . VnVmVn ■ ■ ■ ^^Z^T^^ . . . ^ Wq . . . Wt-lWt+l ■ ■ ■ W„W,„?«^ . . . Wj—^Wj^ . . . Wq (5) 

As done for the case \ve\ > |w^|, we denoty by Sk the set of equations of type k, 
k = 1, ... ,5. As for the equations given in the paper for the case \ve\ > |?«^|, we can 
prove similarly the following proposition: 

Proposition 2. For all k = 1, . . . ,5, Sk holds. 

As for the case \vi\ > jw^l, we have the following proposition (which is in fact 
indepent from the cases \vi\ = jw^j or not): 

Propositions. For alii G {l,...,n}, \viVj\ — \wiWj\. 
Case study There are four cases: 



(0 


\M 


= \wi 


= and 


"I 




0; 


{^^) 


\M 




^ and 


Vj 


= 


= 0; 


(Hi) 




= \Wi 


and 






^0; 


(iv) 






^ and 






0; 



Cases [iv) is syntactically the same as case (m) if we consider the mirror of the 
equations. Therefore we consider only case and [iii). For each of those three 

cases, we prove that Vid„ — Wid^ (Theorem|2|i. 

Similarly as the case \vg\ > \wi\, we can assume wlog that vq = e or wq = e, 
and i;g = e or Wq = e, otherwise we remove their common prefixes in the systems 
<Si, . . . , S5. 

Subcase \v£\ — = \vj\ — \wj\ — 

Lemma 8. If\ve\ = \we\ =0 and Ivjl = \wj\ = 0, then Vid„ — Wid„. 

Proof. It is an obvious consequence of 55. □ 

Subcase \vi\ — \w£\ ^ and \vj\ — \wj\ ^ 

Lemma 9. There exist ti, ^2 G ^* such that tit2 is primitive and ao, l3o,ai, /3£ > 
such that: 



vi = {tihY' m^{t2tiY' Wo ^ e vo ^ {t2tiY°t2 vo = e ^ wo ^ {tit2f"ti 
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Proof. Remind that by hypothesis, vi ^ e. Then wg ^ e. By ^2, there exists a > such 
that {ve,Y ™d {wt)"" have a common factor of length at least + gcd(|w^|, \wt\). 
By the fundamental lemma, there exist ii,t2 6 such that tit2 is primitive, G 
(^1^2)^ and e (^2^1)^- We now infer the form of vq when wq = e (the form of wq 
when Vq — e can be obtained by symmetry). Wlog, we can assume that ti 7^ e. Indeed, 
since ^ e, we have ^1^2 ^ e, so that if ti = e, then we take t[ = ^2 and ^3=^1= e, 
and we have e [t'lt'^)'^ and wi G (i2i'i)+- By ^2, we get uo(iii2)'^ = [hhT ■ By 
Lemma[3]3] uo = {hti)"'"^ for some Q!o > 0. □ 

Since by hypothesis we have \vj\ = \ijuj\ ^ 0, by considering the mirror of the 
equations, we can prove the following corollary of Lemma|9] 

Corollary 2. There exist t^^ti G S* such that t^t^ is primitive and a-Q, /3g, aj, /3j > 
such that: 

Under certain conditions, we can characterize the form of u,;'s and Wi's: 

Lemma 10. If there exists 1 < k < n such that \ vk\ ^ |wfc| then there exist ai, . . . , a„, . . . , /3„ > 
such that for all i ^ k: 

Proof. There are two cases: either — e or wq — £■ We consider the second case 
only, the first being symmetric. By Lemma|9] Vq — (i2^i)"°^2 for some ao > and 
ti,t2 G with tit2 primitive. By 53 and 54, we have: 

(1) VoV^{tit2r = W,{t2tir (2) VoVu{tit2y = Wk{t2tlY (3) VoVkV^{ht2Y = WkW,{t2tiY 

We again consider two cases: 

1. VoVk = WkW for some w. S2 gives w{tit2)'^ = (i2^i)'^- By Lemma I3l3l w = 
{t2t\Yt2 for some /3 > 0. ^3 gives wVi{tit2Y — u'i(^2^i)", and by Si, we get 
wv^{ht2Y = vov,{tit2y, i.e. {t2ti)h2V,{tit2Y - (t2t 1 ^2 v^ 1^2 Since 
\vk\ 7^ \wk\ and v^Vk — WkW, \vo\ ^ and /3 ao. Thus by taking 7 ~ 
\ao - /3| > 0, we get (tii2)^w,(tit2)" = v,{tit2)'^. By Lemma[3l8l G {tit2)*. 

2. Wk — voVkV for some v e. S2 gives (^1^2)" — v{t2tiY, i.e. (^1^2)" = 
vt2{tit2)'^ ■ Therefore by Lemma I3l6l vt2 G (tit2)"- Since w ^ e, we get v = 
{tit2)^ti for some 77 > 0. Now, ^3 gives Vi(tit2)" — vwi{t2ti)'^ , and by Si, 

V^{ht2r = VVoV^{ht2r = {tlt2)ni{t2h)''H2V.,{tit2r = (^1^2 + ' f» (^1^2 • 

Bv Lemma|3l8l G (iit2)*- 

In both cases Vi G (^1*2)*- By iSi voVi{tit2Y = (^2^1)" = Wi{t2ti)" and by Lemma 
[3l6]w, G {t2ti)*. 

Again by considering the mirror of the equations, we can prove the following corol- 
lary of LemmafTOl 
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Corollary 3. If there exists I < k < n such that \vj:\ ^ \wj\ then there exist aj, . . . , I3j, . . . , /3?r > 
such that for all i ^ k: 

vj = mur^ wj = (uhf^ 

Lemma 11. Let a G N. If for all i e {1 . . . n}, \vi\ — \wi \ and there exist at, hi G N 
such that: 

{t2tlTt2V,{tit2T = W,{t2tlfH2 (3) 

then 

{t2tlTt2Vi . . . W„ = Wi . . . Wn{t2tl)°'t2 

Proof. From Eq|3] and \vi \ — | | we deduce that hi = a + a^, so that: 

{t2tlTt2V, ^ W,{t2tiTt2 (4) 

By induction on n we show that (t2ii)"^2Wi . . . u„ = wi . . . w„(i2^i)"i2- Indeed, 
it is trivial if n = 0. So suppose it is true for n — 1, we have: 

{t2tl)°'t2Vi ...Vn 

= wi . . . Wn-i{t2ti)°'t2Vn (by induction hypothesis) 
= wi...wnit2ti)''t2 (by®) 

□ 

Proposition 4. One of the following propositions holds: 

1. Vz e {1, . . . , n} : = {tit2T^ A = (^2*1)^' A v- = (^3*4)"" A w- = {Uts)!^ 

2. 3A; S {1, . . . , n}Vi ^ fc : = and = 

Proof. Indeed, if there are fc 7^ fc' such that 7^ jiufel and 7^ l^fe'l, then by 
Lemma [TOl Vz : — (^1^2)"' A = (t2t\)^' ■ By Proposition [3] |iyjr| and 

1^1' I 7^ l""^!;' I so that by Corollary|3] for all «, "fj = (^3^4)"" and toj — (i^ij)^. 

Otherwise we have at most one k with \v^^^ ^ \w]~\, and for all i 7^ fc, = \wi\, 
and by Prop.|3] = □ 

We now prove Theorem |2] for each of the cases of Prop. |4] This is done in two 
lemmas: Lemma [T2] and Lemma [T3] 

Lemma 12. If for all i e {1, . . . , n}, Vi — (^1^2)"% Wi — (^2^1)'^% "^^ — (^3*4)"" and 

Wj = {tAts)^, then Vq . . . VnVmVn . . . % = Wo ■ • • WnWmWn . . ■ Wq. 

Proof. First by Lemma|9]and Corollary |2l we have: 

Wq = €^Vo e {t2ti)*t2 Vo = Wo £ {tlt2)*ti 

Since vq — e or: wq — e, and = e or iv^ — e, we can assume wlog that vq = 
(^1^2)"° and Wo — {tit2)^°ti for some ao,Po > 0. Indeed, if wq — e, we simply 
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replace inSi, . . . jS^voby tivo and wq by tiwo (which is indeed of the form {tit2)*ti). 
If vo = e, then it is of the form {tit2)* and Wq is of the form {tit2)*ti- 

Similarly, we can assume wlog that Vq = {t3t4)"o and Wq = {1413)^014, for some 

Now, by Si and <S2, we have: 

VoVeVmVjVQ = VjQWeWmWjWQ 
VoVuiVq = WQWm1i>o 

So we deduce: 

VoVeVraVfVQ 
^ {tlt2r°+"'Vm{t3t4r^+''° 

^ {tlt2r {tit2rvm{t3t4r°{tst4r^ 

^ {tlt2)"'VoVmVo{t3t4r' 

^ itit2r+^HiWmt4{ht4r^+^^ 
^ Wo{tlt2)°'^Wm{t3t4T~U>o 
^ {Ht2T'Wm{ht4T^ 
^ {tlt2r'Wm{t3t4r^ 

Then we conclude with: 

VQVi...VmVn...VjV^ 

= {tit2T'+-+''-vMir^'"^"' 
= {tlt2)"^Vo . . . Ve-lVe+i . . . VnVmVw ■ ■ ■ ■f>f+TV^(i3i4)"^ 

= {tit2)"^wo . ■ . we-iWe+1 . . . WnWmWn ■ ■ ■ tt>£q:rw^(i3i4)"^ by ^5 

= (tlt2)^" + -+^'-^+^^ + ^-''"(tlt2)'^'«^m(t3t4)''n^3i4)^-+-''— +''^-+'^- 
= WqWi . . . WnWmWn ■ ■ ■ Wq 

□ 

Lemma 13. If there exists 3k G {1, . . . , n} such that for all i ^ k, \vi\ = \wi\ and 

|^^■| = \wj\, then Vq . . . VnVmVw ■..Vq = Wo... WnWmWn ... 1%. 

Proof. By hypothesis, we have assumed that vq = eorwo = e, and Vq = e or Wq = e. 
This leads to four cases: 

1. Wq = e and VQ = e; 

2. Vq = e and Vq = e; 



= WoWeWmWfU^ 

= WoWtWmWjWQ 

= {tlt2f'Wm{t3t4)^ 
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3. vq ~ e and — e; 

4. wq = e and = e. 

We have assumed that \vi\ ~ \we\ and |t;^| = \w-j;\ ^ 0, and there is k such that 
for all i ^ k, \vi\ — \wi\ and \vj\ = |wj|. This assumption is symmetric, so that with 
respect to the systems Si, ... ,5^, cases 2 and 4 are symmetric, and case 1 and 3 are 
symmetric. Moreover, the proofs of cases 1 and 2 are very similar, therefore we focus 
on case 1 only. 

From now one, we assume that wo = e and Vq = e. By S3 and Vi = {tit2)"^ 
(Lemma |9]l we have woWfc(iii2)" = Wk{t2'ti)'^ ■ Wlog we can assume that ti ^ e. 
Therefore by Lemma [3l3] there exist ak,bk such that VQVk{tit2)°''' — Wk{t2ti)^''t2, 
equivalently we consider two cases we suppose that either = or that 7^ 0, 6^ = 
i.e. either voVk = wu{t2tif''t2 or voVk{tit2T''~'^ti = Wk. 

- Case voVk = Wk{t2ti)'"'t2: 

First, we know that | | — \wi\ for alH < fc and that there are ,bi e N with 
voVi{tit2)°'^ = Wi{t2tif'H2 (by 53 and Lemma |3l3l l where vq — {tit2)°'°t2, so by 
Lemma nn we have: 

voVi...Vk-i =wi...Wk-iVo (5) 

Second we have voVk — Wk{t2tif''^t2 by hypothesis (the case we are considering). 
Third, again by ^3 and Lemma I3l3] we know that \vi\ = \wi \ for alH > fc and that 
there are a-, 6- g N with voVkVi{tit2)"'' ~ WkWi{t2ti)^'t2 i.e. by replacing voVk 
with Wk{t2ti)^''t2 we have {t2txf''^t2Vi{tit2Y^ — Wi{t2tif'it2, so by Lemma [TTI 
we have: 

{t2txfH2Vk+l . . . W„ = Wk+1 . . . Wn{t2tlT''t2 (6) 

As a consequence we have: 



Vo...Vn 
= Wo . . .Vk-lVkVk+1 ...Vn 
= Wi... Wk^lVQVkVk+l ...Vn 
= Wl . . . Wk-lWk{t2tl)'"'t2Vk+l ...Vn 
= Wi... Wk-lWkWk+1 . . . Wn{t2tlf"'t2 

= Wi...Wn{t2tl)'"'t2 (7) 

- Case VQVk{tit2)'^ti = Wk- We can show that vq . . .Vn{t2'tiY*'^^t2 — wi . . .Wn 
with a very similar proof. 

By symmetry (since v- e and w- ^ e), we have either tj,{t^t'i)'^^v^ — wj^q or 

V^ = t4{t3tiy''WjWQ. 

We conclude the proof by putting this together and showing that vqVi . . . VmVn . . .v- — 

Wi . . .WnWmWn- . - Wq-. 
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- Subcase t^{t4t^Y'"v-^ = wj^Wq'. this implies that i3(i4t3)'^'°w?r . . .vj — Wn . . . w^. 
Moreoverwe know that Wo WfeWm^'ir = WkWmW-^WQi.e.. {t2ti)^''t2Vm — WmH{titzY ■ 
We can deduce: 

VqVi . . . VraVn . . . Uj- 
= Wi . . .Wn{tit2f''tiVmVn ■ ■ .Vj 
= Wi . . .WnW„it'i{tit:iY''Vn ■ ■ .Vj 
= Wi . . . WnWraWn ■ ■ . Wq 

- Subcase v-j: = t4{t3t4ywjrwQ: this implies that Vn--.vj — t4(tst4ywn . . .Wq. 
Moreover we know that VQVkVmVj; — WkWmWjrw-Q i.e. (i2^i)''i2Wmt4(i3i4)^ = 
Wm- We can deduce: 

VqVi . ..VmVn...Vj 
= Wi...Wn {tlt2ftiVmVn . . . Vj 
= Wi . ..Wn{tlt2ftiVmt4{ts,tifWn...'WQ 
= Wi.. .WnWmWn- ■ ■ 

□ 

Subcase = \w£\ ^ and \vj\ — \wj\ — Similarly as Proposition|4] one can 
prove the following proposition: 

Proposition 5. One of the following propositions holds: 

1. V^ e {1, . . . , n} : = (iit2)"' A = (^2^1)'^' 

2. 3k e {l,...,n}Vi ^ k : \v,\ = 

Lemma 14. If\vi\ — \wi\ ^ and \vj\ — \'wj\ — 0, then Vid„ — Wid„. 

Proof. Let pose Vi = vq . . . vi^ivi+i . . . w„, resp. Wi = wq . . . Wi^iwi-^i . . . Wn, and 

V = VmVn ■ ■ ■ Vj+iVj-i • • • % = VmVn . . . Vq, rCSp. W = W^Wn . . . WJ^-^^WJ_^ . . . Wq = 

WmWn ■ ■ ■ 'Wq- By ^5 wc havc V\V — W\W . We can suppose wlog that W\ — V\W' , 
i.e. we have: 

Y = W'W (8) 

Now let V2 = Wo . . . Un and W2 — wq . . .Wn- We have Vid„ — V2V and Wid„ = 
W2W. We will show that W2 — V2W' . This will conclude the proof as with Eq.|8]we 
have v,d„ = V2V = V2W'W = W2W = w,d„- 

First note that Lemma |9] is valid in this context and therefore we have wq = e ^ 
Vo G {t2t\)*t2 and wo = e wo e (^1^2)*^!, as above we can consider that vq = 
{t2tiYH2 and Wo = {t2ti)^° ■ 

We consider two cases following Proposition|5] 

1. Vi e {1, . . . , n} : Ui = {tit2)"' Awi^ (^2^1)'^': Let write a = ao + • • • + 

a£ - l + a£+l^ ha„ and/?oH 1-/3^ -l + (3l + l-\ h/3„ wehave Vi = 

Vq... Vt^iVe+i ...Vn = (<2il)"i2 and Wi = Wq... Wi-iWe+i ...Wn = {t2tlY, 

therefore W = {t2ti)"-^t2. Moreover ^2 = 14(^1^2)"' and W2 = W^2(^l^2)"^ 
as a result W2 = V2W'. 
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2. 3k £ {1, ... , n}yi ^ k : \vi\ — \wi\: By using the same construction as for Eq. 
|2]of Lemma [T3l we can show that there exists ak such that W' — {t2ti)°'''t2 with 
Wi = ViW and W2 = V2W'. 

a 

6 A PS PACE algorithm for functionality 

We now show how the pumping lemma for functionality can be used to decide function- 
aUty in PSpace. It relies an NLogSpace algorithm for functionality of FSTs, which 
is a consequence of the following pumping argument by Schiitzenberger: 

Theorem 4 (Schiitzenberger, 1975 Iflll ). Let T be an FST with m states. IfT is non- 
functionnal then there exists a word w of length at most 3 * m? that admits two different 
outputs. 

As a consequence, we obtain: 

Theorem 5. Functionality o/FSTs is decidable in NLogSpace. 

Proof We give a Co-NEogSpace algorithm. The result follows as Co-NEogSpace 
= NEogSpace. 

Note that each transition outputs a sequence of letters of bounded length, therefore 
one can bound polynomially the length of the two different outputs for a single input 
that witnesses non-functionality. Eet us point out that two outputs differ either because 
one is a strict prefix of the other, or on a common position their letters differ. By a small 
trick and a new dummy symbol in the input alphabet, it is easy to reduce the first case 
to the second one with an augmentation of the FST of constant size. 

We consider a non-deterministic algorithm for deciding non-functionality, operat- 
ing as follows: one guesses a position i in the output where two outputs differ. Then 
using only logarithmic space, one can check that this guess is correct. At each step, 
this algorithm guesses itself one letter of the input and the two transitions of the two 
runs computing the two different outputs. Therefore at each step, this algorithm keeps 
two counters and the two states reached by the two runs so far. The first (resp. second) 
counter counts the length of the first (resp. second) output. When one of the outputs has 
reached position i, the algorithm stores the i-th letter of this output, and continue until 
the other output reaches the i-th position. At this point, the two runs are in two states 
p, q, and one just has to check whether the two letters at the i-th position are different. 
Finally, the algorithm checks whether the two runs can be continued into successful 
runs (from p and q) on the same input. This can be again done in non-deterministic 
logarithmic space. 

By Schiitzenberger's Theorem, one can take i < 3m^, and therefore the two coun- 
ters are represented in logarithmic space in the size of the FST. □ 

We can now give a PSpace algorithm for functionality. We devise a construction 
which given a VPT A, builds an FST B that simulates A for nested input words of 
small height. The height of the input word being polynomially bounded (EemmaHJ, 
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one can bound similarly the height of the stack of the VPT. Then, as runs cross only 
finitely many stacks, one can incorporate these stacks into a finite-control part, turning 
the VPT into an FST. This construction is correct in the following sense: 

Proposition 6. For all VPT A with n states, one can construct an FST B of expo- 
nential size wrt n, such that Dom{B) = {u G Dom{A) \ h{u) < 8n'^} and for all 
w G Dom{B), B{w) = A^w). Moreover, A is functional iff B is functional. 

The idea is to apply the NLogSpace algorithm of Theorem |5] on B. However, 
building this FST B of exponential size wrt to the size of the VPT A as the first step 
of an algorithm will not yield a PSPACE algorithm. Therefore, the construction of the 
transition rules of B has to be performed on-demand when such a transition is needed. 
Altogether, this gives a PsPACE algorithm for deciding functionahty of VPTs. 
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